Starting a Sentence in L2 German - Discourse Annotation of a Learner Corpus
نویسندگان
چکیده
Learner corpora consist of texts produced by second language (L2) learners. I We present ALeS Ko, a learner corpus of Chinese L2 learners of German and discuss the multi-layer annotation of the left sentence periphery notably the Vorfeld.
منابع مشابه
The ALeSKo learner corpus : Design – annotation – quantitative analyses
The ALesKo learner corpus is a small-scale comparable corpus consisting of two subcorpora: annotated essays by advanced Chinese learners of German and comparable essays by German native speakers. The motivation for its compilation was the investigation of discourse-related phenomena such as local coherence in second-language acquisition of German. After introducing how the texts were compiled a...
متن کاملVerb Second in Advanced L2 English: A Learner Corpus Study
The present study examines the interface between syntax and discourse-pragmatics in the production of verb second (V2) structures in a corpus of English texts by advanced L1 German and Dutch speakers. The evidence shows that the residual V2 produced by the learner groups studied is the result of a deficit at the interface rather than the transfer of narrow V2 syntax per se. The analysis offered...
متن کاملMetadiscourse Markers in a Corpus of Learner Language: The Case of Iranian EFL Learners
Different issues have been probed in learner corpus research since the late 1980s.However, taking the im- portance of meta discourse markers (MDMs) in signposting academic discourse, their use in Iranian EFL learners‟ academic essays is an area of research in need of a more serious analysis. Contributing to this line of investigation, this paper reports a corpus-based study of the use of MDMs i...
متن کاملTo what extent does sentence-internal realisation reflect discourse context? A study on word order
We compare the impact of sentenceinternal vs. sentence-external features on word order prediction in two generation settings: starting out from a discriminative surface realisation ranking model for an LFG grammar of German, we enrich the feature set with lexical chain features from the discourse context which can be robustly detected and reflect rough grammatical correlates of notions from the...
متن کاملPhonetic annotation of a non-native speech corpus
Annotating non-native speech on a phonetic level is an extremely labour-intensive task and therefore requires a proper balance between the expected benefit and the resources needed. This paper reports on the experience gained when collecting and annotating a corpus of English sentences recorded by students with Italian and German as their mother tongue. The annotated data were used intensively ...
متن کامل